46 research outputs found
Large-Scale Multi-Label Learning with Incomplete Label Assignments
Multi-label learning deals with the classification problems where each
instance can be assigned with multiple labels simultaneously. Conventional
multi-label learning approaches mainly focus on exploiting label correlations.
It is usually assumed, explicitly or implicitly, that the label sets for
training instances are fully labeled without any missing labels. However, in
many real-world multi-label datasets, the label assignments for training
instances can be incomplete. Some ground-truth labels can be missed by the
labeler from the label set. This problem is especially typical when the number
instances is very large, and the labeling cost is very high, which makes it
almost impossible to get a fully labeled training set. In this paper, we study
the problem of large-scale multi-label learning with incomplete label
assignments. We propose an approach, called MPU, based upon positive and
unlabeled stochastic gradient descent and stacked models. Unlike prior works,
our method can effectively and efficiently consider missing labels and label
correlations simultaneously, and is very scalable, that has linear time
complexities over the size of the data. Extensive experiments on two real-world
multi-label datasets show that our MPU model consistently outperform other
commonly-used baselines
Tensor-Based Multi-Modality Feature Selection and Regression for Alzheimer's Disease Diagnosis
The assessment of Alzheimer's Disease (AD) and Mild Cognitive Impairment
(MCI) associated with brain changes remains a challenging task. Recent studies
have demonstrated that combination of multi-modality imaging techniques can
better reflect pathological characteristics and contribute to more accurate
diagnosis of AD and MCI. In this paper, we propose a novel tensor-based
multi-modality feature selection and regression method for diagnosis and
biomarker identification of AD and MCI from normal controls. Specifically, we
leverage the tensor structure to exploit high-level correlation information
inherent in the multi-modality data, and investigate tensor-level sparsity in
the multilinear regression model. We present the practical advantages of our
method for the analysis of ADNI data using three imaging modalities (VBM- MRI,
FDG-PET and AV45-PET) with clinical parameters of disease severity and
cognitive scores. The experimental results demonstrate the superior performance
of our proposed method against the state-of-the-art for the disease diagnosis
and the identification of disease-specific regions and modality-related
differences. The code for this work is publicly available at
https://github.com/junfish/BIOS22
A Comparison of Image Denoising Methods
The advancement of imaging devices and countless images generated everyday
pose an increasingly high demand on image denoising, which still remains a
challenging task in terms of both effectiveness and efficiency. To improve
denoising quality, numerous denoising techniques and approaches have been
proposed in the past decades, including different transforms, regularization
terms, algebraic representations and especially advanced deep neural network
(DNN) architectures. Despite their sophistication, many methods may fail to
achieve desirable results for simultaneous noise removal and fine detail
preservation. In this paper, to investigate the applicability of existing
denoising techniques, we compare a variety of denoising methods on both
synthetic and real-world datasets for different applications. We also introduce
a new dataset for benchmarking, and the evaluations are performed from four
different perspectives including quantitative metrics, visual effects, human
ratings and computational cost. Our experiments demonstrate: (i) the
effectiveness and efficiency of representative traditional denoisers for
various denoising tasks, (ii) a simple matrix-based algorithm may be able to
produce similar results compared with its tensor counterparts, and (iii) the
notable achievements of DNN models, which exhibit impressive generalization
ability and show state-of-the-art performance on various datasets. In spite of
the progress in recent years, we discuss shortcomings and possible extensions
of existing techniques. Datasets, code and results are made publicly available
and will be continuously updated at
https://github.com/ZhaomingKong/Denoising-Comparison.Comment: In this paper, we intend to collect and compare various denoising
methods to investigate their effectiveness, efficiency, applicability and
generalization ability with both synthetic and real-world experiment
Recommended from our members
Joint analysis of three genome-wide association studies of esophageal squamous cell carcinoma in Chinese populations
We conducted a joint (pooled) analysis of three genome-wide association studies (GWAS) 1-3 of esophageal squamous cell carcinoma (ESCC) in ethnic Chinese (5,337 ESCC cases and 5,787 controls) with 9,654 ESCC cases and 10,058 controls for follow-up. In a logistic regression model adjusted for age, sex, study, and two eigenvectors, two new loci achieved genome-wide significance, marked by rs7447927 at 5q31.2 (per-allele odds ratio (OR) = 0.85, 95% CI 0.82-0.88; P=7.72x10−20) and rs1642764 at 17p13.1 (per-allele OR= 0.88, 95% CI 0.85-0.91; P=3.10x10−13). rs7447927 is a synonymous single nucleotide polymorphism (SNP) in TMEM173 and rs1642764 is an intronic SNP in ATP1B2, near TP53. Furthermore, a locus in the HLA class II region at 6p21.32 (rs35597309) achieved genome-wide significance in the two populations at highest risk for ESSC (OR=1.33, 95% CI 1.22-1.46; P=1.99x10−10). Our joint analysis identified new ESCC susceptibility loci overall as well as a new locus unique to the ESCC high risk Taihang Mountain region
Psychometric Properties of Multi-Dimensional Scale of Perceived Social Support in Chinese Parents of Children with Cerebral Palsy
The Multi-dimensional Scale of Perceived Social Support (MSPSS) is one of the most extensively used instruments to assess social support. The purpose of this research was to test the reliability, factorial validity, concurrent validity and measurement invariance across gender groups of the MSPSS in Chinese parents of children with cerebral palsy. A total of 487 participants aged 21–55 years were recruited to complete the Chinese MSPSS and Parenting Stress Index-Short Form (PSI-SF). Composite reliability was calculated as the internal consistency of the Chinese MSPSS and a (multi-group) confirmatory factor analysis (CFA) was conducted to test the factorial validity and measurement invariance across gender. And Pearson correlations were calculated to test the relationships between MSPSS and PSI-SF. The Chinese MSPSS had satisfactory internal reliability with composite reliability values of more than 0.7. The CFA indicated that the original three-factor model was replicated in this specific population. Importantly, the results of the multi-group CFA demonstrated that configural, metric, and scalar invariance across gender groups was supported. In addition, all the three subscales of MSPSS were significant related with PSI-SF. These findings suggest that the Chinese MSPSS is a reliable and valid tool for assessing social support and can generally be utilized across sex in the parents of children with cerebral palsy
HBcompare: Classifying Ligand Binding Preferences with Hydrogen Bond Topology
This paper presents HBcompare, a method that classifies protein structures according to ligand binding preference categories by analyzing hydrogen bond topology. HBcompare excludes other characteristics of protein structure so that, in the event of accurate classification, it can implicate the involvement of hydrogen bonds in selective binding. This approach contrasts from methods that represent many aspects of protein structure because holistic representations cannot associate classification with just one characteristic. To our knowledge, HBcompare is the first technique with this capability. On five datasets of proteins that catalyze similar reactions with different preferred ligands, HBcompare correctly categorized proteins with similar ligand binding preferences 89.5% of the time. Using only hydrogen bond topology, classification accuracy with HBcompare surpassed standard structure-based comparison algorithms that use atomic coordinates. As a tool for implicating the role of hydrogen bonds in protein function categories, HBcompare represents a first step towards the automatic explanation of biochemical mechanisms